469 research outputs found
On the effective deployment of current machine translation technology
Machine translation is a fundamental technology that is gaining more importance
each day in our multilingual society. Companies and particulars are
turning their attention to machine translation since it dramatically cuts down
their expenses on translation and interpreting. However, the output of current
machine translation systems is still far from the quality of translations generated
by human experts. The overall goal of this thesis is to narrow down
this quality gap by developing new methodologies and tools that improve the
broader and more efficient deployment of machine translation technology.
We start by proposing a new technique to improve the quality of the
translations generated by fully-automatic machine translation systems. The
key insight of our approach is that different translation systems, implementing
different approaches and technologies, can exhibit different strengths and
limitations. Therefore, a proper combination of the outputs of such different
systems has the potential to produce translations of improved quality.
We present minimum BayesÂż risk system combination, an automatic approach
that detects the best parts of the candidate translations and combines them
to generate a consensus translation that is optimal with respect to a particular
performance metric. We thoroughly describe the formalization of our
approach as a weighted ensemble of probability distributions and provide efficient
algorithms to obtain the optimal consensus translation according to the
widespread BLEU score. Empirical results show that the proposed approach
is indeed able to generate statistically better translations than the provided
candidates. Compared to other state-of-the-art systems combination methods,
our approach reports similar performance not requiring any additional data
but the candidate translations.
Then, we focus our attention on how to improve the utility of automatic
translations for the end-user of the system. Since automatic translations are
not perfect, a desirable feature of machine translation systems is the ability
to predict at run-time the quality of the generated translations. Quality estimation
is usually addressed as a regression problem where a quality score
is predicted from a set of features that represents the translation. However, although the concept of translation quality is intuitively clear, there is no
consensus on which are the features that actually account for it. As a consequence,
quality estimation systems for machine translation have to utilize
a large number of weak features to predict translation quality. This involves
several learning problems related to feature collinearity and ambiguity, and
due to the ÂżcurseÂż of dimensionality. We address these challenges by adopting
a two-step training methodology. First, a dimensionality reduction method
computes, from the original features, the reduced set of features that better
explains translation quality. Then, a prediction model is built from this
reduced set to finally predict the quality score. We study various reduction
methods previously used in the literature and propose two new ones based on
statistical multivariate analysis techniques. More specifically, the proposed dimensionality
reduction methods are based on partial least squares regression.
The results of a thorough experimentation show that the quality estimation
systems estimated following the proposed two-step methodology obtain better
prediction accuracy that systems estimated using all the original features.
Moreover, one of the proposed dimensionality reduction methods obtained the
best prediction accuracy with only a fraction of the original features. This
feature reduction ratio is important because it implies a dramatic reduction
of the operating times of the quality estimation system.
An alternative use of current machine translation systems is to embed them
within an interactive editing environment where the system and a human expert
collaborate to generate error-free translations. This interactive machine
translation approach have shown to reduce supervision effort of the user in
comparison to the conventional decoupled post-edition approach. However,
interactive machine translation considers the translation system as a passive
agent in the interaction process. In other words, the system only suggests translations
to the user, who then makes the necessary supervision decisions. As
a result, the user is bound to exhaustively supervise every suggested translation.
This passive approach ensures error-free translations but it also demands
a large amount of supervision effort from the user.
Finally, we study different techniques to improve the productivity of current
interactive machine translation systems. Specifically, we focus on the development
of alternative approaches where the system becomes an active agent
in the interaction process. We propose two different active approaches. On the
one hand, we describe an active interaction approach where the system informs
the user about the reliability of the suggested translations. The hope is that
this information may help the user to locate translation errors thus improving
the overall translation productivity. We propose different scores to measure translation reliability at the word and sentence levels and study the influence
of such information in the productivity of an interactive machine translation
system. Empirical results show that the proposed active interaction protocol
is able to achieve a large reduction in supervision effort while still generating
translations of very high quality. On the other hand, we study an active learning
framework for interactive machine translation. In this case, the system is
not only able to inform the user of which suggested translations should be
supervised, but it is also able to learn from the user-supervised translations to
improve its future suggestions. We develop a value-of-information criterion to
select which automatic translations undergo user supervision. However, given
its high computational complexity, in practice we study different selection
strategies that approximate this optimal criterion. Results of a large scale experimentation
show that the proposed active learning framework is able to
obtain better compromises between the quality of the generated translations
and the human effort required to obtain them. Moreover, in comparison to
a conventional interactive machine translation system, our proposal obtained
translations of twice the quality with the same supervision effort.González Rubio, J. (2014). On the effective deployment of current machine translation technology [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/37888TESI
Planar ESPAR Array Design with Nonsymmetrical Pattern by Means of Finite-Element Method, Domain Decomposition, and Spherical Wave Expansion
The application of a 3D domain decomposition finite-element and spherical mode expansion for the design of planar ESPAR (electronically steerable passive array radiator) made with probe-fed circular microstrip patches is presented in this work. A global generalized scattering matrix (GSM) in terms of spherical modes is obtained analytically from the GSM of the isolated patches by using rotation and translation properties of spherical waves. The whole behaviour of the array is characterized including all the mutual coupling effects between its elements. This procedure has been firstly validated by analyzing an array of monopoles on a ground plane, and then it has been applied to synthesize a prescribed radiation pattern optimizing the reactive loads connected to the feeding ports of the array of circular patches by means of a genetic algorithm
Active learning for interactive machine translation
Translation needs have greatly increased
during the last years. In many situations, text to be translated constitutes an
unbounded stream of data that grows continually with time. An effective approach
to translate text documents is to follow
an interactive-predictive paradigm in which
both the system is guided by the user
and the user is assisted by the system to
generate error-free translations. Unfortunately, when processing such unbounded
data streams even this approach requires an
overwhelming amount of manpower. Is in
this scenario where the use of active learning techniques is compelling. In this work,
we propose different active learning techniques for interactive machine translation.
Results show that for a given translation
quality the use of active learning allows us
to greatly reduce the human effort required
to translate the sentences in the stream.González Rubio, J.; Ortiz MartĂnez, D.; Casacuberta Nolla, F. (2012). Active learning for interactive machine translation. En Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics. 245-254. http://hdl.handle.net/10251/1639524525
Dimensionality reduction methods for machine translation quality estimation
The final publication is available at Springer via http://dx.doi.org/10.1007/s10590-013-9139-3[EN] Quality estimation (QE) for machine translation is usually addressed as a regression problem where a learning model is used to predict a quality score from a (usually highly-redundant) set of features that represent the translation. This redundancy hinders model learning, and thus penalizes the performance of quality estimation systems. We propose different dimensionality reduction methods based on partial least squares regression to overcome this problem, and compare them against several reduction methods previously used in the QE literature. Moreover, we study how the use of such methods influence the performance of different learning models. Experiments carried out on the English-Spanish WMT12 QE task showed that it is possible to improve prediction accuracy while significantly reducing the size of the feature sets.This work supported by the European Union Seventh Framework Program (FP7/2007-2013) under the CasMaCat project (grants agreement no. 287576), by Spanish MICINN under TIASA (TIN2009-14205-C04-02) project, and by the Generalitat Valenciana under grant ALMPR (Prometeo/2009/014).González Rubio, J.; Navarro Cerdán, JR.; Casacuberta Nolla, F. (2013). Dimensionality reduction methods for machine translation quality estimation. Machine Translation. 27(3-4):281-301. https://doi.org/10.1007/s10590-013-9139-3S281301273-4Amaldi E, Kann V (1998) On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems. Theor Comput Sci 209(1–2):237–260Anderson TW (1958) An introduction to multivariate statistical analysis. Wiley, New YorkAvramidis E (2012) Quality estimation for machine translation output using linguistic analysis and decoding features. In: Proceedings of the seventh workshop on statistical machine translation, pp 84–90Bellman RE (1961) Adaptive control processes: a guided tour. Rand Corporation research studies. Princeton University Press, PrincetonBisani M, Ney H (2004) Bootstrap estimates for confidence intervals in asr performance evaluation. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing, vol 1, pp 409–412Blatz J, Fitzgerald E, Foster G, Gandrabur S, Goutte C, Kulesza A, Sanchis A, Ueffing N (2004) Confidence estimation for machine translation. In: Proceedings of the international conference on Computational Linguistics, pp 315–321Callison-Burch C, Koehn P, Monz C, Post M, Soricut R, Specia L (2012) Findings of the 2012 workshop on statistical machine translation. In: Proceedings of the seventh workshop on statistical machine translation, pp 10–51Chong I, Jun C (2005) Performance of some variable selection methods when multicollinearity is present. Chemom Intell Lab Syst 78(1–2):103–112Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297Gamon M, Aue A, Smets M (2005) Sentence-Level MT evaluation without reference translations: beyond language modeling. In: Proceedings of the conference of the European Association for Machine TranslationGandrabur S, Foster G (2003) Confidence estimation for text prediction. In: Proceedings of the conference on computational natural language learning, pp 315–321Geladi P, Kowalski BR (1986) Partial least-squares regression: a tutorial. Anal Chim Acta 185(1):1–17González-Rubio J, Ortiz-MartĂnez D, Casacuberta F (2010) Balancing user effort and translation error in interactive machine translation via confidence measures. In: Proceedinss of the meeting of the association for computational linguistics, pp 173–177González-Rubio J, SanchĂs A, Casacuberta F (2012) Prhlt submission to the wmt12 quality estimation task. In: Proceedings of the seventh workshop on statistical machine translation, pp 104–108Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. Machine Learning Research 3:1157–1182Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. SIGKDD Explor Newsl 11(1):10–18Hotelling H (1931) The generalization of Student’s ratio. Ann Math Stat 2(3):360–378Koehn P, Hoang H, Birch A, Callison-Burch C, Federico M, Bertoldi N, Cowan B, Shen W, Moran C, Zens R, Dyer C, Bojar O, Constantin A, Herbst E (2007) Moses: open source toolkit for statistical machine translation. In: Proceedings of the association for computational linguistics, demonstration sessionKohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97(1–2):273–324Pearson K (1901) On lines and planes of closest fit to systems of points in space. Philos Mag 2:559–572Platt JC (1999) Using analytic QP and sparseness to speed training of support vector machines. In: Proceedings of the conference on advances in neural information processing systems II, pp 557–563Quinlan RJ (1992) Learning with continuous classes. In: Proceedings of the Australian joint conference on artificial intelligence, pp 343–348Quirk C (2004) Training a sentence-level machine translation confidence measure. In: Proceedings of conference on language resources and evaluation, pp 825–828Sanchis A, Juan A, Vidal E (2007) Estimation of confidence measures for machine translation. In: Proceedings of the machine translation summit XI, pp 407–412Scott DW, Thompson JR (1983) Probability density estimation in higher dimensions. In: Proceedings of the fifteenth symposium on the interface, computer science and statistics, pp 173–179Soricut R, Echihabi A (2010) TrustRank: inducing trust in automatic translations via ranking. In: Proceedings of the meeting of the association for computational linguistics, pp 612–621Soricut R, Bach N, Wang Z (2012) The SDL language weaver systems in the WMT12 quality estimation shared task. In: Proceedings of the seventh workshop on statistical machine translation. Montreal, Canada, pp 145–151Specia L, Saunders C, Wang Z, Shawe-Taylor J, Turchi M (2009a) Improving the confidence of machine translation quality estimates. In: Proceedings of the machine translation summit XIISpecia L, Turchi M, Cancedda N, Dymetman M, Cristianini N (2009b) Estimating the sentence-level quality of machine translation systems. In: Proceedings of the meeting of the European Association for Machine Translation, pp 28–35Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B 58:267–288Ueffing N, Ney H (2007) Word-level confidence estimation for machine translation. Comput Ling 33:9–40Ueffing N, Macherey K, Ney H (2003) Confidence measures for statistical machine translation. In: Proceedings of the MT summit IX, pp 394–401Wold H (1966) Estimation of principal components and related models by iterative least squares. Academic Press, New Yor
Effect of the solvent on the conformational behavior of the alanine dipeptide deduced from MD simulations
In general, peptides do not exhibit a well-defined conformational profile in solution. However, despite the experimental blurred picture associated with their structure, compelling spectroscopic evidence shows that peptides exhibit local order. The conformational profile of a peptide is the result of a balance between intramolecular interactions between different atoms of the molecule and intermolecular interactions between atoms of the molecule and the solvent. Accordingly, the conformational profile of a peptide will change upon the properties of the solvent it is soaked. To get insight into the balance between intra- and intermolecular interactions on the conformational preferences of the peptide backbone we have studied the conformational profile of the alanine dipeptide in diverse solvents using molecular dynamics as sampling technique. Solvents studied include chloroform, methanol, dimethyl sulfoxide, water and N-methylacetamide. Different treatments of the solvent have been studied in the present work including explicit solvent molecules, a generalized Born model and using the bulk dielectric constant of the solvent. The diverse calculations identify four major conformations with different populations in the diverse solvents: the C7 eq only sampled in chloroform; the C5 or extended conformation; the polyproline (PII) conformation and the right-handed a-helix conformation (aR). The results of present calculations permit to analyze how the balance between intra- and intermolecular interactions explains the populations of the diverse conformations observed.Postprint (published version
Proyectando el futuro: un estudio sobre toma de decisiones vocacionales desde un enfoque narrativo autobiográfico
El trabajo que presentamos se desarrolla en el marco de la educaciĂłn secundaria obligatoria y se centra en la perspectiva de seis alumnos y una alumna con dificultades de aprendizaje procedentes de una pequeña poblaciĂłn rural española. El diseño incluye algunos aspectos novedosos al combinar estrategias de investigaciĂłn e intervenciĂłn basadas en las nociones de mediaciĂłn y re-mediaciĂłn socio-cultural. Se asume un doble objetivo analĂtico y práctico: por un lado, analizar la forma como los estudiantes construyen discursivamente significados identitarios vinculados con la toma de decisiones sobre su futuro; por otro, ofrecer a los orientados herramientas que, a modo de dispositivo re-mediador, faciliten la tarea reflexiva que supone la construcciĂłn y proyecciĂłn de la propia carrera acadĂ©mica y laboral. Se concluye que el uso reflexivo de narrativas autobiográficas permite integrar la orientaciĂłn vocacional en el contexto social, cultural e institucional en que se produce, logrando asĂ una mayor significatividad e impacto en la vida del orientado.We explore the vocational decisions of seven students with learning challenges living in a rural area in Spain and who were completing their final year of compulsory education. Our design combined research and intervention strategies based on notions of sociocultural mediation and re-mediation. Our objectives were both analytical and practical. On the one hand, a sociocultural approach was used to explore how identity meanings are used by the students on a discursive construction related to their vocational choices; on the other hand, students had been provided with tools that facilitated the reflexive act of constructing and projecting their own academic and professional careers. We conclude that the reflexive use of autobiographical narratives makes it possible to provide vocational guidance in social, cultural, and institutional environments, helping to increase its impact on students' lives.In unserer Studie haben wir uns mit den Berufsentscheidungen von sieben schulpflichtigen SchĂĽler/innen mit Lernschwierigkeiten im letzten Jahr ihres Schulbesuchs in einer ländlichen spanischen Region befasst. Unsere Forschungs- und Interventionsstrategien basierten auf Konzepten der soziokulturellen Mediation und Re-Mediation und waren zugleich analytisch und praxisorientiert: Einerseits sollten mittels eines soziokulturellen Zugangs die Identitätskonzepte verstanden werden, die den diskursiven Konstruktionen der zukĂĽnftigen Berufswahl unterlagen. Andererseits gaben wir den SchĂĽler/innen Werkzeuge an die Hand, die die reflexive Konstruktion und Projektion möglicher Berufskarrieren unterstĂĽtzten. Die reflexive Verwendung solcher autobiografischen Narrative kann wichtige Hilfen fĂĽr derartige Entscheidungsprozesse in unterschiedlichen sozialen, kulturellen und institutionellen Umgebungen bieten
Sensitivity Study of the Dynamics of Three-Phase Photovoltaic Inverters With an LCL Grid Filter
[EN] An accurate small-signal model of three-phase photovoltaic (PV) inverters with a high-order grid filter is derived in this paper. The proposed model takes into account the influence of both the inverter operating point and the PV panel characteristics on the inverter dynamic response. A sensitivity study of the control loops to variations of the dc voltage, PV panel transconductance, supplied power, and grid inductance is performed using the proposed small-signal model. Analytical and experimental results carried out on a 100-kW PV inverter are presented.Figueres AmorĂłs, E.; Garcerá, G.; Sandia Paredes, J.; González EspĂn, FJ.; Calvo Rubio, J. (2009). Sensitivity Study of the Dynamics of Three-Phase Photovoltaic Inverters With an LCL Grid Filter. IEEE Transactions on Industrial Electronics. 56(3):706-717. doi:10.1109/TIE.2008.2010175S70671756
Production and marketing characteristics of pig production in the south of the State of Mexico
The paper examines the structure and the operation of pig production in the south of the state of Mexico, it explores the various stages that make up the commercialization process of the activity and the different actors involved, it means, all the process that is followed since the departure of the animal from the farm until its arrival as a final product for the consumer. It is also calculated the marketing margins resulting during the different stages of the process, which are used as indicators of profitability. The study was held in 2007 when a total of 17 producers of pork, two collectors, 28 retailers and 37 consumers of pig meat were polled. The objective of the survey was to gather information about the production process, actors, marketing costs and prices, and then determine the margins of marketing and characterization of pig production in the South of the state of Mexico. The results showed that at current prices, the producers participation in the final price, paid by the consumer, was 26,22 %, while the intermediaries participation was 73,78 %. The total marketing margin averaged 23,93 /kg (98,28 %), while the remaining 0,40 $/kg was for the collectors. The traditional marketing process used in the region is: producer, collector, retailer and final consumer
Estrategias pedagógicas para la sana convivencia desde la cátedra de la paz
El presente estudio consistiĂł en diseñar estrategias pedagĂłgicas para la asignatura cátedra de la paz, que fortalezcan la sana convivencia de los estudiantes del grado sexto de la InstituciĂłn Educativa Integrado JoaquĂn González Camargo del municipio de Sogamoso Boyacá. La investigaciĂłn surge desde la necesidad de fomentar la cultura de la paz y la sana convivencia en las aulas de clase, debido a que en el plan curricular y en la metodologĂa dada, se evidencia poco material pedagĂłgico. Esta investigaciĂłn es de tipo Mixto pues, se utilizaron instrumentos y tĂ©cnicas cuantitativas y cualitativas de recolecciĂłn de informaciĂłn como la observaciĂłn no participante, el cuestionario, la entrevista y aportes de autores que han trabajado sobre el tema. Los instrumentos se aplicaron a 104 estudiantes que corresponden a los estudiantes de grado sexto, cuyas edades oscilan entre 11 y 14 años. Se evidenciĂł que la aplicaciĂłn de estrategias didácticas, ayudan bastante en la formaciĂłn, ya que despiertan el interĂ©s, mejoran conocimientos en los contenidos poco entendibles y, tienen mayor capacidad y fluidez para expresar lo aprendido
Vision-based techniques for automatic marine plankton classification
Plankton are an important component of life on Earth. Since the 19th century, scientists have attempted to quantify species distributions using many techniques, such as direct counting, sizing, and classification with microscopes. Since then, extraordinary work has been performed regarding the development of plankton imaging systems, producing a massive backlog of images that await classification. Automatic image processing and classification approaches are opening new avenues for avoiding time-consuming manual procedures. While some algorithms have been adapted from many other applications for use with plankton, other exciting techniques have been developed exclusively for this issue. Achieving higher accuracy than that of human taxonomists is not yet possible, but an expeditious analysis is essential for discovering the world beyond plankton. Recent studies have shown the imminent development of real-time, in situ plankton image classification systems, which have only been slowed down by the complex implementations of algorithms on low-power processing hardware. This article compiles the techniques that have been proposed for classifying marine plankton, focusing on automatic methods that utilize image processing, from the beginnings of this field to the present day.Funding for open access charge: Universidad de Málaga / CBUA.
Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature.
The authors wish to thank Alonso Hernández-Guerra for his frm support in the
development of oceanographic technology. Special thanks to Laia Armengol for her help in the domain
of plankton. This study has been funded by Feder of the UE through the RES-COAST Mac-Interreg pro ject (MAC2/3.5b/314). We also acknowledge the European Union projects SUMMER (Grant Agreement
817806) and TRIATLAS (Grant Agreement 817578) from the Horizon 2020 Research and Innovation
Programme and the Ministry of Science from the Spanish Government through the Project DESAFĂŤO
(PID2020-118118RB-I00)
- …